Reversibility reconsidered: finite-state factors for efficient probabilistic sampling in parsing and generation
نویسندگان
چکیده
We restate the classical logical notion of generation/parsing reversibility in terms of feasible probabilistic sampling, and argue for an implementation based on finite-state factors. We propose a modular decomposition that reconciles generation accuracy with parsing robustness and allows the introduction of dynamic contextual factors.
منابع مشابه
Finite-state subset approximation of phrase structure
We describe a method and a software tool to approximate and manipulate phrase structure grammars by a string representation of derivation trees and an encoding of a finite automaton that recognizes such strings. Many linguistically natural extensions to phrase structure grammars can be modeled on top of the approximation, allowing for a generic mechanism to model parsing and generation of a var...
متن کاملInherently Reversible Grammars, Logic Programming And Computability
This paper a t tempts to clarify two distinct notions of "reversibility": (i) Uniformity of implementation of parsing and generation, and (it) reversibility as an inherent (or intrinsic) property of grammars. On the one hand, we explain why grammars specified as definite programs (or the various related "unification grammars") lead to uniformity of implementation. On the other hand, we define d...
متن کاملStochastic Inversion Transduction Grammars, with Application to Segmentation, Bracketing, and Alignment of Parallel Corpora
We introduce (1) a novel stochastic inversion transduction grammar formalism for bilingual language modeling of sentence-pairs, and (2) the concept of bilingual parsing with potential application to a variety of parallel corpus analysis problems. The formalism combines three tactics against the constraints that render finite-state transducers less useful: it skips directly to a context-free rat...
متن کاملRegular Approximation as a Heuristics for A* Parsing
Parsing probabilistic context-free grammars generated from treebanks can be made more efficient by employing heuristics to reduce the search space. Klein and Manning (2003) applied A* search to parsing and achieved a huge efficiency gain using several search estimates which rely on grammar transformation and context summaries. We review ideas that have been published and propose a new estimate ...
متن کاملEvaluation of Finite State Morphological Analyzers Based on Paradigm Extraction from Wiktionary
Wiktionary provides lexical information for an increasing number of languages, including morphological inflection tables. It is a good resource for automatically learning rule-based analysis of the inflectional morphology of a language. This paper performs an extensive evaluation of a method to extract generalized paradigms from morphological inflection tables, which can be converted to weighte...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015